The IBM Personal Speech Assistant
نویسندگان
چکیده
In this paper, we describe technology and experience with an experimental personal information manager, which interacts with the user primarily but not exclusively through speech recognition and synthesis. This device, which controls a client PDA, is known as the Personal Speech Assistant (PSA). The PSA contains complete speech recognition, speech synthesis and dialog management systems. Packaged in a hand-sized enclosure, of size and physical design to mate with the popular Palm III personal digital assistant, the PSA includes its own battery, microphone, speaker, audio input and output amplifiers, processor and memory. The PSA supports speaker-independentEnglish speech recognition using a 500-word vocabulary, and English speech synthesis on an arbitrary vocabulary. We survey the technical issues we encountered in building the hardware and software for this device, and the solutions we implemented, including audio system design, power and space budget, speech recognition in adverse acoustic environments with constrained processing resources, dialog management, appealing applications, and overall system architecture.
منابع مشابه
IBM MASTOR System: Multilingual Automatic Speech-To-Speech Translator
In this paper, we describe the IBM MASTOR, a speech-to-speech translation system that can translate spontaneous free-form speech in real-time on both laptop and hand-held PDAs. Challenges include speech recognition and machine translation in adverse environments, lack of training data and linguistic resources for under-studied languages, and the need to rapidly develop capabilities for new lang...
متن کاملDeveloping a voice-spelling alphabet for PDAs
A persistent problem with personal digital assistants (PDAs) is the difficulty of entering data into the devices. The best current solutions to the problem are small soft keyboards and constrained handwriting recognizers. Another solution is use of speech. PDAs do not yet have the power to support full speech dictation, but they do have sufficient power to support voice spelling. Voice-spelling...
متن کاملMobile Reading Assistant for Blind People
This paper describes an embedded device dedicated for blind or visually impaired people. The main aim of this system is to build an automatic text reading assistant using existing hardware associated with innovative algorithms. A personal digital assistant (PDA) was chosen because it combines small-size, computational resources and low cost price. Three key technologies are necessary: text dete...
متن کاملAdaptation of the AhoTTS Text to Speech System to PDA Platfoms
This paper presents the work carried on to adapt a Basque language Text-To-Speech (TTS) system into a mobile device of limited resources. The aim is to make possible the use of the AhoTTS conversion system of the UPV/EHU ́s Aholab group, in a Personal Digital Assistant (PDA), and to test the system performance in several aspects, such as sound sample generation times. The selected PDA is a Pocke...
متن کاملTeleradiology on a Personal Digital Assistant
This paper describes the porting of a teleradiology system to a Personal Digital Assistant (PDA). The basis for this formed the CHILI teleradiology and PACS system developed by the Steinbeis Transferzentrum Medizinische Informatik, Heidelberg (STZ) in cooperation with the German Cancer Research Center. The work was done as part of a EU IST project called Multimedia Terminal Mobile (MTM). The au...
متن کامل